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ABSTRACT 



A method and system for distributing a service request 
among a plurality of servers is disclosed. A portion metric is 
assigned to each one of the plurality of servers, the portion 
metric designating a portion of total server requests to be 
allocated to the one of the plurality of servers. A server 
request may then be received. A total number of server 
requests processed by the plurality of servers is incremented 
and a number of server requests distributed to each one of 
the plurality of servers is maintained. The server request is 
then distributed to one of the plurality of servers using the 
portion metric assigned to each one of the plurality of 
servers, the number of server requests distributed to each 
one of the plurality of servers, and the total number of server 
requests. 

37 Claims, 6 Drawing Sheets 
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CROSS-REFERENCE TO RELATED 
APPLICATION 

This application is related to U.S. application Ser, No. 
09/175,673 filed on the same day entitled "A System and 
Method for Determining a Preferred Service in a Network", 
and U.S. application Ser. No. 09/175,516 filed on the same 
day and entitled "A System and Method for Information 
Retrieval Regarding Services". 

BACKGROUND OF THE INVENTION 15 

1. Field of the Invention 

The present invention relates generally to traffic redirec- 
tion in a distributed system. More particularly, the present 
invention relates to a method and apparatus for redirecting 20 
traffic among a number of servers using a portion metric for 
each server. 

2. Description of the Related Art 

A computer network may be defined as an interconnected 
collection of autonomous computers. In a distributed 
system, the existence of these multiple autonomous com- 
puters is transparent to the user. To achieve this 
transparency, allocation of jobs to processors and all other 
system functions must be automatic. These automated sys- 
tem functions are typically provided by an operating system. "^^ 

In general, the operating system hides the details of the 
hardware from the user and provides the user with a con- 
venient interface for using the system . More particularly, the 
operating system is responsible for allocating resources 
within the distributed system and schedules the execution of 
various services accordingly. Thus, the operating system 
selects the best processor, locates and transfers all corre- 
sponding appropriate location. In this manner, the operating 
system ensures that system resources such as file servers are 
used in an eflficient manner. 

The resource allocation provided by the operating system 
includes the retrieval and processing of data. Often, this data 
is stored on one or more shared file servers. Users in such a 
system are called clients. Communication from a client 45 
generally comprises a request message asking for a particu- 
lar service to performed. The service request message is then 
sent to an appropriate server. The server then does the work 
requested and sends back a reply. Thus, data is accessed and 
processed by the server in accordance with the service jq 
request message. 

In order to send a service request message to a server, the 
operating system must first select an appropriate server. 
Typically, the operating system selects a server according to 
criteria that may be apphcd through the use of a metric. By 55 
way of example, a commonly used metric distributes the 
service request message to a server closest in distance to the 
client. Accordingly, the operating system may direct traffic 
to a server according to a specified metric. 

Although various metrics exist for allocating resources 60 
within a network, these metrics do not adjust assignment of 
server requests in accordance with the load capacity of each 
server. By way of example, each server may have different 
processing capabilities. As yet another example, a server 
may be entirely unavailable. Metrics traditionally used in 65 
distributed systems do not adjust assignment of requests 
according to such situations. As a result, these metrics 
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cannot adequately maximize the throughput of a distributed 
system having servers with heterogeneous load capabiUiies. 
It would be desirable if a metric were provided such that 
server requests could be distributed in accordance with the 
load capacity of each server. 

Every host and router on the Internet has an IP address. 
The Domain Name System (DNS) is often used to map host 
names to these IP addresses. By way of example, a client 
typically sends a DNS query to a DNS server which includes 
a host name and an indication that an IP address is requested. 
The DNS server then returns an IP address associated with 
the host name. It would therefore be desirable if a DNS 
server were designed to accomplish load distribution com- 
patible with the Interact and DNS. 

In addition to ensuring adequate load distribution, it is 
necessary to accommodate for shifts of information. Infor- 
mation is commonly transferred from one web server to 
another web server on the Internet. Typically, a web server 
will redirect a chent to a new location due to this physical 
shift of information between web servers. However, the host 
of the client is typically not taken into consideration during 
selection of the web server in providing this redirect. In 
many instances it would be desirable if the information 
could be selectively varied according to the particular client 
requesting the information. It would therefore be desirable if 
the host of the client were considered during selection of an 
appropriate web server in order to provide such a redirect. 
Moreover, it would be beneficial if the client could easily 
identify the server to which the server request is redirected. 

In view of the above, a system and method for redirecting 
traflSc in a distributed system according to individual server 
capabilities would be desirable. Additionally, it would be 
beneficial if such a system distributed traffic in proportion to 
individual server portion metrics. 

SUMMARY OF THE INVENTION 

The present invention provides a method and apparatus 
for distributing server requests among a plurality of servers 
in a distributed system. This is accomplished through assign- 
ing a portion metric to each server. In this manner, the 
portion metric allows capabilities of each candidate server to 
be taken into consideration during distribution of each server 
request. 

In accordance with one aspect of the present invention, 
each server request is distributed in accordance with the 
Domain Name System (DNS). Configuration is performed 
in several steps. First, a Domain Name System host name is 
associated with a plurality of servers, each one of the servers 
having a unique IP address. By way of example, a DNS table 
on a DNS server typically includes a plurality of entries, 
each of the entries containing an IP address-host name 
association. Second, a portion metric is then assigned to 
each one of the plurality of servers. The portion metric 
designates a portion of total server requests to be allocated 
to the one of the plurality of servers. As a resuh, each web 
server is assigned a portion metric. Each portion metric 
designates a portion of total server requests to be allocated 
to the corresponding server. Once configuration is 
completed, server requests may be allocated. 

Each server request is separately allocated to a selected 
web server. The server request may include a DNS host 
name query received from a chent. By way of example, a 
DNS host name query may include a host name to be looked 
up and an indication that an IP address is requested, A 
plurahty of IP addresses associated with the host name may 
then be obtained, each one of the IP addresses being asso- 
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ciated with one of the plurality of servers. A total number of tion is accepted, A total number of server requests processed 

server requests processed by the plurality of servers is by the plurality of servers is incremented. In addition, a 

incremented. In addition, a number of server requests dis- number of server requests distributed to each of the servers 

tributed to each one of the plurality of servers is maintained. is maintained. One of the plurality of servers is then selected 

One of the plurality of servers is selected using the portion 5 using the portion metric assigned to each of the servers, the 

metric assigned to each one of the plurality of servers, the number of server requests distributed to each of the servers, 

number of server requests distributed to each one of the ^jjj jqi^i number of server requests. An HTTP code 

pluraUty of servers, and the total number of server requests. ^^^^^^^ ^ ^^^^ ^ t^e client. Therefore, information 

An IP address associated with the selected server is then -^^ ^ ^^^^^ selectively varied according to 

provided to the client from which the server request was particular cUent requesting the information, 

obtamed. Accordingly, when each server request is received, ^ , - . . - 

one of the plurality of IP addresses is determined and ^h^ advantages of assigning a portion metric to a set of 

provided to the client. In this manner, traffic sent to geo- '"l^ distnbuUng a senr.ce request m accordance with 

graphically distributed web serves in multiple networks P?'^'°" =«;^'gnnienLs are numerous The present 

may be distributed and monitored. Moreover, the present „ "^^^ f"'. aUow capabiht.es of each candi- 

invention may be and is preferably geographically distant " date server to be taken into consideration during distnbutton 

from the web servers. Thus, connections distributed to each ^^^"^ ^'^^ ^^'l"^^- way of example, portion metric 

web server may be monitored and load averages of each web assignments may be used to maximize the throughput of a 

server may be subsequently queried. distributed system having servers with diverse processor 

•^^ L L f. Speeds. Similarly, portion metrics may be reassigned Upon a 

In accordance with another aspect 01 the present ^n f. *• *i- * • 1 ui r 

, . J • -.1. determmation that a server is unavailable, 
invention, the portion metric is used in combination with 

other metrics. A portion metric is assigned to each one of a BRIEF DESCRIPTION OF THE DRAWINGS 
plurality of servers. The portion metric for each one of the 

plurality of servers is added to obtain a total portion metric, FIG. 1 is a block diagram illustrating an exemplary 

A number of server requests distributed to each one of the 25 distributed system in which the portion metric of the present 

plurality of servers is maintained. In addition, a total number invention may be implemented. 

of server requests processed by the plurality of servers is FIG. 2 is a block diagram illustrating a distributed system 

incremented. A server request is then received. A server in which the distributed director of the present invention is 

request percentage is computed for each one of the plurality implemented. 

of servers. The server request percentage for each one of the 3^ pjc. 3 is a flow diagram illustrating a method for oper- 

plurality of servers is a product of the number of server ^^^^ t^e distributed director in DNS mode according to one 

requests distributed to the one of the plurality of servers and embodiment of the present invention, 

the total portion metric divided by the total number of server ™^ a a j- n * ** *u j r 1 

^ - J, jj.,- t 11. J FlG.4isa flow diagram illustrating a method for calcu- 

requests received. In addition, a metric value is calculated , . . ^ ^ 1 1* 

. 1-11, . • 1 r latine the server request percentage for one of the plurality 

for each one of the plurality of servers. The metric value for ,c ^ j- * u j- * r *u 

r.L 1 f • J c ju .u ^ •'^ of servers accordmg to an embodiment or the present 

oneof the plurality of servers is denned by the server request . ^. ^ ^ 

. c .J r 1 1* r J *u invention, 
percentage for the one of the plurabty of servers and the 

portion metric for the one of the plurality of servers. The FIG. 5 is a flow diagram illustrating a method for deter- 

metric values for the plurality of servers are compared to n^ining the metric value for one of the plurality of servers 

obtain a selected server. 40 according to one embodiment of the present mvenlion. 

The selected server may include a set of servers. The set FIG- ^ is a flow diagram illustrating a method for oper- 

of servers may include those servers having an identical ating the distributed director in HTTP redirect mode accord - 

metric value. Alternatively, a tolerance range may be con- to one embodiment of the invention, 

figured for the portion metric. The tolerance range may then DETAILED DESCRIFHON OF THE 

be used m companng the me nc value for each of the 45 PREFERRED EMBODIMENTS 
servers. Thus, a set of the plurahty of servers havmg a metric 

value within the tolerance range may be obtained. An The present invention provides a method and system for 

alternate metric may then be applied to the set of servers to redirecting traffic among a number of servers. This is accom- 

obtain a single selected server. By way of example, the plished through ascertaining the host for the client, deter- 

alternate metric may include a distance metric. The selected 50 mining a plurality of servers associated with the host, and 

server is then provided to the client. selecting one of the plurality of servers. The selection may 

In accordance with yet another aspect of the invention, be based upon defined portion metrics as well as other 

load imbalance may be detected. As described above, a set metrics. Various embodiments of the invention are possible, 

of the plurality of servers is obtained upon application of the By way of example, the present invention may be imple- 

portion metric. When the set of the plurality of servers 55 mentcd in accordance with the Domain Name System as 

includes only one server, this may indicate the presence of well as the HTTP protocol. In addition, the invention can be 

load imbalance. By way of example, if there is a large implemented in numerous ways, including as a method, an 

disparity in the amount of server requests distributed to each apparatus such as a switching element (e.g., router or 

server, there will be one server that has received the least brouter), or a computer readable medium. Several embodi- 

amount of server requests, and this server will therefore be 60 ments of the invention are discussed below, 

selected. A report may be generated indicating the presence Referring first to FIG. 1, a block diagram illustrating an 

of this load imbalance in the distributed system. Moreover, exemplary distributed system in which the portion metric of 

the detection of load imbalance permits reallocation of the present invention may be implemented is presented. A 

servers as well as the reconfiguration of portion metrics and distributed system 10 wiU typicaQy include one or more 

tolerance ranges. 65 clients 12 and a plurality of servers. As illustrated, the 

In accordance with a further aspect of the invention, a plurality of servers includes a first server 14, a second server 

portion metric is assigned to each server. An HTTP connec- 16, and a third server 18, Typically, a client 12 will send a 
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request for a service which will be distributed to one of the 
plurality of servers. To distribute each such service request, 
one or more metrics may be used in the selection process. In 
a commonly used metric, the service request is distributed to 
the server closest in distance to the client. The first server 14 5 
is a first distance 20 from the client 12, the second server 16 
is a second distance 22 from the client 12, and the third 
server 18 is a third distance 24 from the client 12. As shown, 
the second distance 22 indicates that the second server 16 is 
closest in distance to the client 12. Accordingly, the service -jq 
request would be distributed to the second server 16 if such 
a metric were applied. However, in circumstances where 
each one of the plurality of servers 14, 16, 18 is equidistant 
from the client 12, another metric may be used as a "tie 
breaker." Moreover, the distance between the client and 
selected server may be deemed insignificant in comparison 
to factors such as the processor speed of each available 
server. In such instances, an alternate metric may be desir- 
able. 

Referring next to FIG. 2, a block diagram of a distributed 20 
system in which the distributed director of the present 
invention according to one embodiment is presented. A 
distributed system 70 in which the present invention may be 
implemented is illustrated. According to the present 
invention, a distributed director 72 may be provided for each 25 
host. As shown, the distributed director 72 is provided for an 
Internet host 74. As will be described, a cHent 76 connecting 
to the Internet 74 may wish to connect to a web server 
located in another geographical location. By way of 
example, the San Jose web server 78 and Paris web server 30 
80 are shown. The distributed director 72 determines a host 
associated with the client 76 (e.g., patents.com). The dis- 
tributed director 72 may be configured to be the primary 
DNS server for a particular host. Thus, once the host is 
determined, the IP addresses associated with this host may 35 
easily be obtained. In this manner, available web servers 
associated with these IP addresses are made available for 
selection. 

In response to a server request, the distributed director 72 
utilizes one or more metrics to select one of these web 40 
servers. Metrics utilized by the distributed director 72 may 
include a variety of metrics, including the distance metric. 
An information gathering block 82 may be utilized to gather 
metric information such as the distance to each server. The 
information gathering block 82 may include a router and 45 
therefore must have access to all routing tables relating to 
the geographically distributed Web servers. By way of 
example, the distributed director 72 may query the infor- 
mation gathering block 82 for distance metrics between the 
distributed servers 78, 80 and the client 76. According to one 50 
embodiment, the information gathering block 82 may 
include a Direct Response Protocol (DRP) agent, available 
from Cisco Technology, Inc., located in San Jose, Calif. In 
addition, the distributed director 72 provides a portion 
metric which allows capabilities of each candidate server to 55 
be taken into consideration during distribution of each server 
request. The distributed director 72 may therefore direct 
clients to an appropriate server that is also closest in dis- 
tance. In this manner, the distributed director 72 provides 
dynamic, transparent, and scalable Internet traffic load dis- 60 
tribution between multiple geographically dispersed servers. 

Referring next to FIG. 3, a flow diagram illustrating a 
method for operating the distributed director in DNS mode 
according to one embodiment of the invention is shown. As 
shown, a service request is distributed among a plurality of 65 
servers 26. First, at step 28, the distributed director is 
configured. Configuration may include associating each host 
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with a plurality of IP addresses corresponding to a plurality 
of web servers in the distributed system, assigning a portion 
metric to each server, assigning other metrics to each server, 
and specifying a tolerance range for application of each 
specified metric. 

A portion metric PORTION_METRIC[X] is assigned to 
each one of the plurality of servers, where X«l to the total 
number of servers N. Each portion metric designates a 
portion of total server requests to be allocated to the corre- 
sponding one of the plurality of servers. For example, 
portion metrics 20%, 32%, and 60% may be assigned to 
three servers, respectively. The portion metrics do not nec- 
essarily need to total a value of one hundred. Thus, if 
percentages have been overassigned as in the previous 
example, then the percentage of server requests assigned to 
each server will automatically be scaled down by each 
server's portion metric such that they total 100% of requests 
and retain their relative configured portion of requests. 

According to one embodiment, a portion metric is 
assigned to each server in the distributed system using the 
DNS host name. For example, "HOST SERVERl POR- 
TION 32" could be used to indicate that the portion of server 
requests for the DNS host "SERVERl" is 32. These portion 
metrics may then be used to determine a percentage of server 
requests to send to each server Moreover, a default portion 
metric may be provided. By way of example, the default 
portion metric may be zero. As a result, servers with a higher 
portion metric will receive a larger number of server 
requests than servers with a lower portion metric. This 
allows a network administrator to adjust server load distri- 
bution across heterogeneous distributed servers, resuUing in 
improved performance as seen by clients. 

The portion metric may be used in combination with other 
metrics or alone to distribute server requests in a network or 
distributed system to select an appropriate server for each 
client. Similarly, any of these metrics may be turned off or 
on to add or remove metrics which are to be considered. For 
example, a command "PORTION X METRIC2 ..." may be 
provided to add the portion metric and a second metric, 
"METRIC2". Similarly, a command "NO METRIC3" could 
be used to turn oS a third metric, "METRIC3". When the 
portion metric is used in combination with other metrics, the 
order in which these metrics are considered is specified. By 
way of example, the first metric specified in the command 
line may be considered first, followed by each following 
metric specified. 

In addition to specifying a portion metric as well as other 
metrics for each server, a tolerance range may be configured. 
The tolerance range supphes a range used to determine, 
relative to a lowest or highest value, whether any distributed 
servers should be equally preferred for a given client. For 
example, assume there are three servers: SERVERl, 
SERVER2 and SERVER3 associated with values 100, 119 
and 125, respectively. If the tolerance value is set to 20 
percent, SERVERl (associated with value 100) and 
SERVER2 (associated with value 119) would be equally 
preferred since 119 is within the 20 percent tolerance range. 
SERVER3 (associated with value 125) would be eliminated 
from consideration. 

The present invention may be configured to function in a 
DNS mode as well as in HTTP redirect mode. Additional 
DNS resource records must be added to the primary 
domain's primary DNS server to identify the distributed 
director as the authoritative name server for a given host in 
DNS mode, or as the actual Web server requested by the end 
user in HTTP redirect mode. Moreover, DNS resource 
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records used by the distributed director may be configured in metrics may be assigned a weight during configuration 

an external server or, alternatively, in the distributed direc- indicating a metric priority. If multiple metrics have the 

tor. same metric priority, the metrics may be added to obtain a 

Next, at step 29, initialization of variables is performed. composite metric. For example, if two metrics have the same 
During this step, a variable storing a total number of server 5 metric value, the metric values may each be multiplied by 

requests processed by the plurality of servers, TOTAL_ Iheir respective weight values (if specified) and then added 

REQUESTS, is initialized to a constant. In addition, a together to form the composite metric for the corresponding 

variable for each one of the plurality of servers storing a server. According to one embodiment, the default weight 

number of server requests distributed to each one of the values are set to 1. 

plurality of servers, NUMBER_SERVER_REQUESTS[X] The metric value for each one of the plurality of servers 

are each initialized to a constant. According to a preferred are then compared to choose one or more "selected" servers 

embodiment, the variables are initialized to zero. However, (e.g., a server having a lowest or highest metric value) at step 

another constant may be used to avoid the potential problem 40. In the embodiment described below, it is assumed that 

of dividing by zero. the server having the lowest metric value is selected. 

Once the distributed director is configured and initializa- Moreover, a set of the plurality of servers having metric 

tion is performed, a server request may then be received at values within the tolerance range may be "selected", 
step 30. By way of example, the server request may include If multiple servers are determined to have been "selected" 

a DNS host name query that is received from a client. A DNS at step 42 (the metric values associated with the selected 

host name query typically includes a host name to be looked servers are within the tolerance range), an alternate metric 

up and an indication that an IP address is requested. may be used as a "tie breaker" at step 44. By way of 

After the server request is received, the portion metrics example, the server closest in distance to the client may be 

may be utilized as well as other metrics to select an selected. As yet another example, the next metric may be 

appropriate server in response to each server request. used to select the "best" server. 

According to one embodiment, each server request is dis- Next, at step 46, the IP address associated with the 

tributed in accordance with the Domain Name System selected server is provided to the client from which the 

(DNS). At step 32, a plurality of IP addresses associated with server request was obtained. By way of example, the IP 

the queried host name are obtained. Each one of the plurality address may be returned to the client's local DNS. 

of IP addresses are associated with one of a plurality of Therefore, the server request is distributed using the portion 

servers in the distributed system. In this manner, a set of metric assigned to each one of the plurality of servers, the 

servers are obtained firom which one is to be selected. At step number of server requests distributed to each one of the 

34, the portion metrics for each one of the plurahty of plurality of servers, and the total number of server requests 

servers are added to obtain a total portion metric, TOTAL_ as described above. As a result, the distributed director 

PORTION. effectively functions as a DNS server. In this manner, a 
In order to distribute the server request to an appropriate 35 server request may be distributed in accordance with the 

server, metric values for each one of the plurality of servers DNS. 

are computed. A metric value for a selected server X is The tolerance range specified for each port ultimately 

computed using the number of server requests distributed to determines the frequency that the portion metric is used to 

the selected server, the portion metric for the selected server, allocate server requests. By way of example, the first metric 
the total portion metric, and the total number of server 40 to be applied may be the portion metric and the second 

requests processed by the distributed system where metric to be applied may be the distance metric. If the 

METRIC_VALUE[X]«SERVER_REQUEST_ tolerance range for the portion metric is set to 20 perce nt, all 

PERCENTAGE[X]-PORTION_METRIC[X]. At step 36, a servers having a metric value within 20 percent of the 

server request percentage is calculated for each one of the optimum metric value (e.g., lowest or highest metric value) 
plurality of servers. For each server, the server request 45 will be considered to be equally desirable. In this case, the 

percentage is calculated using the number of server requests second distance metric is applied as a "tie breaker", 

distributed to the server, the total portion metric, and the Accordingly, the second distance metric will be used in all 

total number of server requests received where SERVER_ instances where two or more servers are within the tolerance 

REQUEST_PERCENTAGE[X]=NUMBER_SERVER_ range of an optimum metric value. As a result, the portion 
REQUESTS[X]*TOTAL_PORTION/TOTAL_ 50 metric is used only in instances where only one server falls 

REQUESTS. Thus, the server request percentage is a within this tolerance range. In other words, the portion 

percentage of total requests distributed to a server X which metric will ultimately be used only to correct load imbal- 

is normalized to the portion metric assignments. Next, at ances according to the configured tolerance range, while a 

step 38, a metric value is determined for each one of the second metric (e.g., distance metric) will be applied when , 
plurality of servers. For each server, the metric value is 55 there is no load imbalance. 

determined using the server request percentage for the As described above, the frequency that the portion metric 

server, the portion metric for the server, and the total number is applied may provide an indication of load imbalance, 

of server requests received. According to one embodiment. Monitoring the application of the portion metric will there- 

the metric value is obtained by subtracting the portion metric fore indicate whether the load apportionment among the 

from the server request percentage, METRIC_VALUE[X]= servers is appropriate. Accordingly, a counter may be incre- 

SERVER„REQUEST_PERCENTAGE[X]-PORTION_ mented each lime the portion metric is applied. This counter 

METRIC[X]. However, a constant such as TOTAL_ value may then be used to correct any load imbalance 

REQUESTS may be added to this value to ensure that the present in the distributed system. 

METRIC_ VALUE [X] is a positive number. since the application of the portion metric (e.g., one 

If muhiple metrics have been specified for each server, the 65 server is selected according to the portion metric) will 

metrics may be combined to form a single metric value prior indicate the presence of a load imbalance within the distrib- 

to server selection. In addition, each metric or selected utedsystem, a report may be generated each time the portion 
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metric is applied. The report may include information 
designed to assist in correcting load imbalance. By way of 
example, the report may include the number of server 
requests distributed to each server. Thus, the report will 
furnish the necessary information to allow a system admin- 
istrator to reconfigure selected portion metrics and tolerance 
ranges, as well as modify the allocation of the servers. By 
way of example, the geographical distribution of the servers 
may be modified. As yet another example, additional servers 
may be added to a particular geographical location. 

Once the server request has been distributed to a server, 
all variables are updated to reflect the receipt of a new server 
request as well as the allocation of the new server request to 
the selected server. At step 48, the number of server requests 
distributed to the selected server, NUMBER_SERVER_ 
REQUESTS[SELECTED_SERVER] is increased. In this 
manner, the number of server requests distributed to each 
one of the plurality of servers is maintained. According to 
one embodiment, the number of server requests distributed 
to the selected server is increased in response to step 46. 
Similarly, at step 50, the total number of server requests 
processed by the plurality of servers, TOTAL_REQUESTS, 
is incremented. 

The above steps may be modified or performed in an 
alternate order without departing from the scope of the 
invention. For example, according to a presently preferred 
embodiment, the total number of server requests processed 
by the plurality of servers is incremented in response to step 
46. Accordingly, the total number of server requests is 
incremented by one. Alternatively, the total number of server 
requests may be computed as a sum of the number of server 
requests distributed to each one of the plurality of servers. 
As another example, the total number of server requests may 
be incremented in response to step 29. This would eliminate 
the possibility of a "divide by zero" error since the total 
number of server requests would be incremented prior to 
step 36. 

Referring now to FIG. 4, a flow diagram illustrating a 
method for calculating the server request percentage for 
each of the plurality of servers 36 shown in FIG. 2 according 
to an embodiment of the present invention is shown. At step 
52, a first one of the plurality of servers is selected. For 
example, if the server request percentage for each of the 
plurality of servers is stored in an array, each one of the 
plurality of servers may be selected according to an index, 
X. As shown, the first index in such an array may be one. At 
step 54, the number of server requests distributed to the 
selected one of the plurality of servers and the total portion 
metric are multiplied to obtain a product. Next, at step 56, 
the product is divided by the total number of server requests 
received to obtain the server request percentage for the 
selected one of the plurality of servers. According to a 
preferred embodiment, in order to avoid dividing by zero, it 
is determined if the total number of server requests is equal 
to zero. If the total number of server requests is equal to zero, 
the server request percentage for the one of the plurahty of 
servers is initialized to zero. Next, at step 58, it is determined 
whether the server request percentage for each of the plu- 
rality of servers has been calculated. For example, if the 
index, X, is equal to the number of servers N, all necessary 
calculations have been performed. However, if the index, X, 
is less than the number of servers, the server request 
percentage must be calculated for the remaining servers. At 
step 58, if the server request percentage has not been 
calculated for each of the servers, the index, X, is incre- 
mented at step 60, and the method continues at step 54. 

Referring now to FIG. 5, a flow diagram illustrating a 
method for determining the metric value for each of the 
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plurality of servers 38 shown in FIG, 2 according to one 
embodiment of the invention is illustrated. At step 62, a first 
one of the plurality of servers is selected. As described 
above, a server may be selected according to an index, X. At 

5 step 64, the portion metric associated with the selected one 
of the plurality of senders, PORTION_METRICpC], is 
subtracted from the server request percentage for the 
selected one of the plurality of servers, SERVER_ 
REQUEST_PERCENTAGE[X], to obtain a metric value 

10 for the selected one of the plurality of servers, METRIC_ 
VALUE[X]. Thus, METRIC_VALUE[X]=SERVER_ 
REQUEST_PERCENTAGE[X]-PORTION_MErRIC 
[X]). In addition, as described above, a constant such as 
TOTAL_REQUESTS may be added to this value to ensure 

15 that the METRIC_VALUE[X] is a positive number. Next, at 
step 66, it is determined whether the metric value for each 
of the plurality of servers has been calculated. For example, 
if the index, X, is equal to the number of servers N, all 
necessary calculations have been performed. However, if the 

20 index, X, is less than the number of servers, the metric value 
must be calculated for the remaining servers. At step 66, if 
the metric value has not been calculated for each of the 
servers, the index, X, is incremented at step 68, and the 
method loops at step 64 until the metric value for each server 

25 has been calculated. 

According to another embodiment of the invention, the 
distributed director operates in HTTP redirect mode in 
accordance with the HTTP protocol. Referring now to FIG. 
6, a method for operating the distributed director in HTTP 

30 redirect mode 84 is presented. The distributed director is 
configured at step 86. Configuration may include associating 
each host with a plurality of IP addresses corresponding to 
a plurality of web servers in the distributed system, assign- 
ing a portion metric to each server, assigning other metrics 

35 to each server, and specifying a tolerance range for appli- 
cation of each specified metric as described above in step 28. 
Next, at step 88, an HTTP connection is accepted from a 
client. Thus, the distributed director operates as though it 
were a web server. The distributed director has an IP address 

40 as well as a different IP address used by a web redirector. As 
a result, the client may communicate with the web redirector 
by connecting to the web redirector' s IP address. The web 
redirector may then redirect connections through the use of 
virtual web servers. By way of example, a different host (and 

45 IP address) is associated with each set of virtual servers. The 
client is then connected to one of a set of virtual web servers 
associated with the distributed director at step 89. The 
client's IP address is then obtained at step 90 since it is later 
used to send an HTTP redirect. A host name (e.g., 

50 www.cisco.com) associated with the IP address of the virtual 
web server connected to is then determined at step 92. This 
may be performed via a DNS server, as described above. 

Once the client is connected to the virtual web server, it 
is desirable to obtain a set of IP addresses from which to 

55 select an IP address and send an HTTP code redirect. One 
problem which may occur when a set of IP addresses 
associated with the host name of the virtual server is 
obtained is that the IP address of the virtual web server 
connected to may ultimately be selected. Since it would be 

60 extremely undesirable to select the virtual web server that 
the client is connected to, a new host name different from the 
host name of the virtual server connected to may be con- 
structed. At step 93, a new host name associated with the 
host name determined in step 92 is constructed. By way of 

65 example, a string may be appended to the host name of the 
virtual server that the cfient is connected to. A set of IP 
addresses of real web servers associated with the new host 
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name are then obtained at step 94. By way of example, the 
real web servers may be associated with the new host name 
in a DNS server. In this manner, the IP address of the virtual 
web server connected to may be excluded, or filtered, from 
selection. 

Once the set of IP addresses are obtained, an IP address 
may be selected. As shown in FIG. 6, one of the set of IP 
addresses may be selected according to steps 32 through 44 
described above. At that time, the client's IP address may be 
used in the selection process depending on the metric(s) 
applied. By way of example, the distance metric may use the 
client's IP address to ascertain the distance between a server 
and the client. The distributed director then sends to the 
client the HTTP status code "302 temporarily moved" with 
a URL for the selected server associated with the selected IP 
address at step 96. The cUenl may then transparently connect 
to the selected web server specified by this URL. System 
variables may then be updated to reflect the server selection 
according to steps 48 through 50 described above. In this 
manner, a client may communicate with the server to which 
it has been redirected transparently through the distributed 
director web server. Accordingly, information provided to a 
chent may be selectively varied according to the particular 
client requesting the information. 

The present invention redirects chents to an appropriate 
server according to one or more metrics. This eliminates the 
need for end users to choose a server from a list of possible 
sites. In addition, since the distributed director localizes 
traffic to Internet servers, end users benefit from improved 
access performance and reduced transmission costs. 

The invention can also be embodied as computer readable 
code on a computer readable medium. The computer read- 
able medium is any data storage device that can store data 
which can thereafter be read by a computer system. 
Examples of the computer readable medium include read- 
only memory, random-access memory, CD-ROMs, mag- 
netic tape, and optical data storage devices. The computer 
readable medium can also be distributed over a distributed 
system between coupled computer systems so that the 
computer readable code is stored and executed in a distrib- 
uted fashion. 

Although illustrative embodiments and applications of 
this invention are shown and described herein, many varia- 
tions and modifications are possible which remain within the 
concept, scope, and spirit of the invention, and these varia- 
tions would become clear to those of ordinary skill in the art 
after perusal of this application. For instance, the present 
invention is described as being implemented and capable of 
operating in DNS mode or HTTP redirect mode. However, 
it should be understood that the invention is not limited to 
such implementations, but instead would equally apply 
regardless of the context and system in which it is imple- 
mented. Thus, broadly speaking, the operations described 
above may be used to distribute traffic in a distributed 
system based on predefined server metrics. In addition, the 
above described invention may be stored on a disk drive, a 
hard drive, a floppy disk, a server computer, or a remotely 
networked computer. Accordingly, the present embodiments 
are to be considered as illustrative and not restrictive, and 
the invention is not to be limited to the details given herein, 
but may be modified within the scope and equivalents of the 
appended claims. 

What is claimed is: 

1. A method for distributing a service request, the method 
comprising: 

assigning a portion metric to each one of a plurality of 
servers, the portion metric designating a portion of total 
server requests to be allocated to the one of the plurality 
of servers; 
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accepting an HTTP connection; 

incrementing a total number of server requests processed 
by the plurality of servers; 

maintaining a number of server requests distributed to 
each one of the plurality of servers; 

selecting one of the plurality of servers using the portion 
metric assigned to each one of the plurality of servers, 
the number of server requests distributed to each one of 
the plurahty of servers, and the total number of server 
requests; and 

sending an HTTP code redirect. 

2. A method for distributing a service request, the method 
comprising: 

determining a metric value associated with each one of a 
plurahty of servers; 

configuring a tolerance range used in comparing the 
metric value for each one of the plurality of servers; 

comparing the metric value for each one of the plurality 
of servers to obtain a set of the pliu^afity of servers, each 
one of the set of the plurality of servers having a metric 
value within the tolerance range; 

applying an ahemate metric to the set of the plurality of 
servers to obtain a selected server when the set of the 
plurahty of servers includes more than one server, and 
otherwise establishing that the selected server is the 
obtained set of the plurahty of servers; and 

providing the selected server. 

3. A method for distributing a service request, the method 
comprising: 

associating each one of a plurality of servers with a 
Domain Name System host name, each one of the 
plurahty of servers having a unique IP address; 

assigning a portion metric to each one of the plurahty of 
servers, the portion metric designating a portion of total 
server requests to be allocated to the one of the plurality 
of servers; 

receiving a server request, the server request being a 
Domain Name System host name query including the 
Domain Name System host name; 

incrementing a total number of server requests processed 
by the plurahty of servers; 

maintaining a number of server requests distributed to 
each one of the plurahty of servers; 

selecting one of the plurahty of servers using the portion 
metric assigned to each one of the plurality of servers, 
the number of server requests distributed to each one of 
the plurahty of servers, and the total number of server 
requests; and 

providing an IP address associated with the selected one 
of the plurality of servers. 

4. The method as recited in claim 3, wherein selecting one 
of the plurahty of servers further includes: 

computing a metric value for each one of the plurality of 
servers using the number of server requests distributed 
to each one of the plurality of servers, the portion 
metric assigned to each one of the plurality of servers, 
and the total number of server requests processed; and 

comparing the metric value for each one of the plurality 
of servers to determine a selected server. 

5. The method as recited in claim 4, wherein maintaining 
the number of server requests distributed to each one of the 
plurality of servers further includes: 

increasing the number of server requests distributed for 
the selected server. 
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6. The method as recited in claim 4, wherein the selected for the one of the plurality of servers, the total portion 
server has a lowest metric value. metric, and the total number of server requests 

7. The method as recited in claim 4, wherein the selected received. 

server has a highest metric value. 17. The method as recited in claim 16, wherein calculating 

8. The method as recited in claim 5, wherein increasing 5 the server request percentage for the one of the plurality of 
the number of server requests distributed is performed in servers includes: 

response to distributing the server request, multiplying the number of server requests distributed to 

9. The method as recited in claim 3, wherein incrementing the one of the plurality of servers and the total portion 
the total number of server requests further includes; metric to obtain a product; and 

incrementing the total number of server requests in 1° dividing the product by the total number of server 

response to receiving the server request. requests received to obtain a server request percentage 

10. The method as recited in claim 3, wherein increment- for the one of the plurality of servers. 

ing the total number of server requests further includes: 18. The method as recited in claim 15, wherein determin- 

incrementing the total number of server requests in ing the metric value for each one of the pluraHty of servers 

response to distributing the server request. further includes: 

11. The method as recited in claim 3, further including: determining a metric value for one of the plurality of 

f ' i 4 A 1 ' If servers using the server request percentage for the one 

if more than one server is selected, applying an alternate ,,t-, . . ^ , 

o «f «i„.oi:t,. LL^Z «kt«;« of the plurahty of servers and the portion metric for the 

metric to a set of the plurauty oi servers to obtam a / . , , ■ i- 

selected server °^ plurahty of servers. 

12. The method as recited in claim 4, fiirther including: . ^ i'^^f^ ^ ^^'f^^^ '^^'f'' 

^ . , , . . ° ing the metnc value for the one of the plurahty of servers 

configuring a tolerance range used in comparmg the further includes: 

metric value for each one of the plurality of servers. u* *u * • c *u f*u i i * 

-.•i i^u - subtractmg the portion metric for the one of the plurality 

13. The method as recited m claun 12, wherem comparmg ^ c * * *u 

, , r L f 1 1. r of servers from the server request percentage for the 

the metnc value for each one of the plurahty of servers ^5 ^ne of the plurality of servers to obtain a metric value 

further includes: pl^^^j.^^ ^^^^^ 

defining a set of the plurahty of servers having metric 20. The method as recited in claim 18, wherein determin- 

values withm the tolerance range and mcludmg the -^^g ^^^^^ ^^^^^ the one of the plurahty of servers 

selected one of the plurahty of servers; and further includes: 
applying an alternate metric to the set of the plurality of 30 subtracting the server request percentage for the one of 

servers to obtain a selected one of the plurality of the plurahty of servers from the portion metric for one 

servers when the set of the plurahty of servers includes ^f the plurality of servers to obtain a metric value for 

more than one server, and otherwise estabhshing that the one of the plurality of servers, 

the selected one of the pluraHty of servers is the set of 21. The method as recited in claim 17, wherein dividing 

the plurahty of servers. 35 product by the total number of server requests further 

14. The method as recited in claim 12, wherein comparing includes: 

the metric value for each one of the plurality of servers ^^^^^^ ^^^^ ^^q^^^t^ ^q^^j t^ ^^^^^ 

further includes: initiahzing the server request percentage for the one of 

obtaining a set of the plurahty of servers, each one of the ^he plurahty of servers to a value of zero; and 

set of the plurahty of servers having a metric value ^^^^^ ^^^^^^ ^^^^ requests is greater than zero, 
within the tolerance range; and dividing the product by the total number of server 
applying an alternate metric to the set of the plurality of requests received to obtain the server request percent- 
servers to obtain a selected one of the plurality of age for the one of the plurality of servers, 
servers when the set of the plurahty of servers includes 22. The method as recited in claim 15, wherein calculating 
more than one server, and otherwise estabhshing that the server request percentage for each one of the plurahty of 
the selected one of the plurahty of servers is the set of servers further includes: 

the plurahty of servers. assigning a value of zero to the server request percentage 

15. The method as recited in claim 4, wherein computing fo^ ^^^h one of the plurality of servers when the total 
the metric value for each one of the plurality of servers number of server requests is equal to zero. 

further includes: 23. The method as recited in claim 3, further including: 

adding the portion metric for each one of the plurality of initializing the total number of server requests and the 

servers to obtain a total portion metric; number of server requests distributed to each one of the 

calculating a server request percentage for each one of the plurality of servers to a constant. 

plurahty of servers using the number of server requests 55 24. The method as recited in claim 23, wherein the 

distributed to each one of the plurality of servers, the constant is zero. 

total portion metric, and the total number of server 25. The method as recited in claim 3, wherein increment- 
requests received; and ing the total number of server requests further includes: 
determining a metric value for each one of the plurahty of adding the number of server requests distributed to each 
servers using the server request percentage distributed go one of the plurality of servers to obtain the total number 
to each one of the plurahty of servers and the portion of server requests. 

metric assigned to each one of the plurality of servers. 26. A method for distributing a service request, the 

16. The method as recited in claim 15, wherein calculating method comprising: 

the server request percentage for each one of the plurality of assigning a portion metric to each one of a plurahty of 

servers further includes: 65 servers, the portion metric designating a portion of total 

calculating a server request percentage for one of the server requests to be allocated to the one of the plurality 

plurality of servers using the number of server requests of servers; 
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adding the portion metric for each one of the plurality of 
servers to obtain a total portion metric; 

maintaining a number of server requests distributed to 
each one of the plurality of servers; 

receiving a server request; ^ 

incrementing a total number of server requests processed 
by the plurality of servers; 

computing a server request percentage for each one of the 
plurahty of servers, the server request percentage for 
one of the plurality of servers being a product of the 
number of server requests distributed to the one of the 
plurahty of servers and the total portion metric divided 
by the total number of server requests received; 

calculating a metric value for each one of the plurahty oi 
servers, the metric value for one of the plurality of 
servers being defined by the server request percentage 
for the one of the plurality of servers and the portion 
metric for the one of the plurality of servers; 

comparing the metric value for each one of the plurality 
of servers to obtain a selected server; 

applying an alternate metric to the selected server when 
the selected server includes more than one server; and 

providing the selected server. 

27. The method as recited in claim 26, wherein the 
alternate metric is a distance metric. 

28. The method as recited in claim 26, further including: 
turning a plurality of metrics on, the plurality of metrics 

including the alternate metric. 

29. The method as recited in claim 26, further including: 
specifying an order in which each of the plurality of 

metrics is considered. 

30. The method as recited in claim 26, further including: 
specifying a priority for each one of the plurahty of 

metrics. 

31. The method as recited in claim 30, wherein specifying 
a priority further includes: 

assigning a weight indicating a metric priority to one of 
the plurality of metrics. 

32. A method for distributing a service request, the 
method comprising: 

assigning a portion metric to each one of a plurahty of 
servers, the portion metric designating a portion of total 
server requests to be allocated to the one of the plurality 
of servers; 

adding the portion metric for each one of the plurality of 

servers to obtain a total portion metric; 
maintaining a number of server requests distributed to 

each one of the plurality of servers; 
receiving a server request; 

incrementing a total number of server requests processed 
by the plurahty of servers; 

computing a server request percentage for each one of the 
plurality of servers, the server request percentage for 
one of the plurahty of servers being a product of the 
number of server requests distributed to the one of the 
plurahty of servers and the total portion metric divided 
by the total number of server requests received; 

calculating a metric value for each one of the plurahty of 
servers, the metric value for one of the plurality of 
servers being defined by the server request percentage 
for the one of the plurahty of servers and the portion 
metric for the one of the plurality of servers; 

configuring a tolerance range used in comparing the 
metric value for each one of the plurality of servers; 
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comparing the metric value for each one of the plurality 
of servers to obtain a set of the plurality of servers, each 
one of the set of the plurality of servers having a metric 
value within the tolerance range; 

applying an alternate metric to the set of the pluraHty of 
servers to obtain a selected server when the set of the 
plurality of servers includes more than one server, and 
otherwise establishing that the selected server is the 
obtained set of the plurality of servers; and 

providing the selected server. 

33. A method for detecting load imbalance within a 
distributed system, the method comprising: 

assigning a portion metric to each one of a plurality of 
servers, the portion metric designating a portion of total 
server requests to be allocated to the one of the plurality 
of servers; 

adding the portion metric for each one of the plurality of 

servers to obtain a total portion metric; 
maintaining a number of server requests distributed to 

each one of the plurahty of servers; 
receiving a server request; 

incrementing a total number of server requests processed 
by the plurality of servens; 

computing a server request percentage for each one of the 
plurality of servers, the server request percentage for 
one of the plurahty of servers being a product of the 
number of server requests distributed to the one of the 
plurahty of servers and the total portion metric divided 
by the total number of server requests received; 

calculating a metric value for each one of the plurahty of 
servers, the metric value for one of the plurahty of 
servers being defined by the server request percentage 
for the one of the plurahty of servers and the portion 
metric for the one of the plurality of servers; 

configuring a tolerance range used in comparing the 
metric value for each one of the plurahty of servers; 

comparing the metric value for each one of the plurality 
of servers to obtain a set of the plurahty of servers, each 
one of the set of the plurality of servers having a metric 
value within the tolerance range; 

detecting load imbalance when the set of the plurahty of 
servers includes only one server; and 

applying an ahernate metric to the set of the plurality of 
servers to obtain a selected server when the set of the 
plurahty of servers includes more than one server, and 
otherwise establishing that the selected server is the 
obtained set of the plurahty of servers. 

34. The method as recited in claim 33, further including: 
generating a report in response to the detection of load 

imbalance, the report including information designed to 
assist in correcting load imbalance. 

35. The method as recited in claim 34, wherein the report 
includes the number of server requests distributed to each 
one of the plurality of servers. 

36. The method as recited in claim 33, further including: 
reassigning portion metrics for selected ones of the plu- 
rality of servers in response to the detection of load 
imbalance. 

37. The method as recited in claim 33, further including: 
reconfiguring the tolerance range in response to the detec- 
tion of load imbalance. 
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